LASIGE: using Conditional Random Fields and ChEBI ontology

نویسندگان

  • Tiago Grego
  • Francisco R. Pinto
  • Francisco M. Couto
چکیده

For participating in the SemEval 2013 challenge of recognition and classification of drug names, we adapted our chemical entity recognition approach consisting in Conditional Random Fields for recognizing chemical terms and lexical similarity for entity resolution to the ChEBI ontology. We obtained promising results, with a best F-measure of 0.81 for the partial matching task when using post-processing. Using only Conditional Random Fields the results are slightly lower, achieving still a good result in terms of Fmeasure. Using the ChEBI ontology allowed a significant improvement in precision (best precision of 0.93 in partial matching task), which indicates that taking advantage of an ontology can be extremely useful for enhancing chemical entity recognition.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

IBEnt: Chemical Entity Mentions in Patents using ChEBI

This article presents our approach to the CEMP task of BioCreative V.5, which consisted in using our system, IBEnt, to identify chemical entity mentions in patents through machine learning and semantic similarity techniques. The features used combine the results of a CRF classifier, two lexical matching methods (FiGO and MER) and semantic similarity measures on ChEBI ontology. We also tested th...

متن کامل

Identifying chemical entities in patents using brown clustering and semantic similarity

This paper presents the system we developed for the CHEMDNER task of BioCreative V. This system was adapted from the IICE framework, which combines Conditional Random Fields, implemented by Stanford NER, brown clustering, implemented by Percy Liang’s Cbased algorithm and a semantic similarity based on the h-index concept, applied to the ChEBI ontology. For the CEMP subtask, we obtained a maximu...

متن کامل

Improving chemical entity recognition through h-index based semantic similarity

BACKGROUND Our approach to the BioCreative IV challenge of recognition and classification of drug names (CHEMDNER task) aimed at achieving high levels of precision by applying semantic similarity validation techniques to Chemical Entities of Biological Interest (ChEBI) mappings. Our assumption is that the chemical entities mentioned in the same fragment of text should share some semantic relati...

متن کامل

Chemical compound and drug name recognition using CRFs and semantic similarity based on ChEBI

This document presents our approach to the BioCreative IV challenge of recognition and classification of drug names (CHEMDNER task). We developed a system based on Conditional Random Fields for recognizing chemical terms, and on ChEBI resolution and semantic similarity techniques for validating the recognition results. Our system created multiple classifiers according to different training data...

متن کامل

A Quality-Assurance Study of ChEBI

Ontologies are important components of many health-information systems. The Chemical Entities of Biological Interest (ChEBI) ontology has become a standard reference for chemicals appearing in biological contexts. As such, assuring the quality of its content is imperative. In fact, ChEBI has a dedicated Web page at which errors and inconsistencies in its concepts can be reported. A study of the...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2013